Auditory scene analysis based on time-frequency integration of shared FM and AM

نویسندگان

  • Mototsugu Abe
  • Shigeru Ando
چکیده

This paper describes a new method for computational auditory scene analysis which is based on 1) waveform operators to extract instantaneous frequency (IF), frequency change (FM), and amplitude change (AM) from subband signals, and 2) a voting method into a probability distribution to extract coherency (shared fundamental frequency, shared FM, and shared AM) involved in them. We introduce non-parametric Kalman filtering for the time-axis integration. A consistent AM operator which is independent to frequency change is newly defined. Sharpness of the resultant probability distribution is examined with relating to the definition of the operators and subband bandwidth. We evaluate the performance of the algorithm by using several speech sounds.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Auditory scene analysis based on time-frequency integration of shared FM and AM (I): Lagrange differential features and frequency-axis integration

カクテルパーティ効果 [1]として知られるように, 人間は混合音環境下においても高い音響認識力を有す る.近年このような聴覚機能に関して,聴覚情景解析 (auditory scene analysis) [2]という枠組のもと,知覚 の心理学と計算論の双方から統一的な接近が進められ ている.特に,Bregman [2]は,知識を利用しない低 次の分離能力に関して,心理実験に基づき,1) 音響信 号はスペクトログラムに似た多数の要素に分解される こと,2) 同じ音源から発せられた要素がグループ化さ れてストリームを形成すること,3) グループ化のさ れやすさ(分凝要件)は,周波数の調和的関係,共通 の立ち上がり時刻,共通の周波数変化,共通の振幅変 化,成分の連続性,時間周波数の近接性,共通の音源 位置,などに関係していることを指摘した.一方,混 合音の分離の工学的研究は,Kaiser ...

متن کامل

Neural representations of complex temporal modulations in the human auditory cortex.

Natural sounds such as speech contain multiple levels and multiple types of temporal modulations. Because of nonlinearities of the auditory system, however, the neural response to multiple, simultaneous temporal modulations cannot be predicted from the neural responses to single modulations. Here we show the cortical neural representation of an auditory stimulus simultaneously frequency modulat...

متن کامل

AM-FM Separation Using Auditory-motivated Filters - Speech and Audio Processing, IEEE Transactions on

An approach to the joint estimation of sine-wave amplitude modulation (AM) and frequency modulation (FM) is described based on the transduction of frequency modulation into amplitude modulation by linear filters, being motivated by the hypothesis that the auditory system uses a similar transduction mechanism in measuring sine-wave FM. An AM-FM estimation algorithm is described that uses the amp...

متن کامل

Dissociable Neural Response Signatures for Slow Amplitude and Frequency Modulation in Human Auditory Cortex

Natural auditory stimuli are characterized by slow fluctuations in amplitude and frequency. However, the degree to which the neural responses to slow amplitude modulation (AM) and frequency modulation (FM) are capable of conveying independent time-varying information, particularly with respect to speech communication, is unclear. In the current electroencephalography (EEG) study, participants l...

متن کامل

Speech recognition with amplitude and frequency modulations.

Amplitude modulation (AM) and frequency modulation (FM) are commonly used in communication, but their relative contributions to speech recognition have not been fully explored. To bridge this gap, we derived slowly varying AM and FM from speech sounds and conducted listening tests using stimuli with different modulations in normal-hearing and cochlear-implant subjects. We found that although AM...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998